Hadoop Application Architectures by Mark Grover Ted Malaska Jonathan Seidman Gwen Shapira

Hadoop Application Architectures by Mark Grover Ted Malaska Jonathan Seidman Gwen Shapira

Author:Mark Grover, Ted Malaska, Jonathan Seidman, Gwen Shapira
Language: eng
Format: epub
Publisher: O'Reilly Media, Inc.
Published: 0201-07-15T00:00:00+00:00


Tip

Because these enterprise workflow automation systems are not Hadoop specific, detailed explanations of how to use each of them are beyond the scope of this book. We will focus on frameworks that are part of the Hadoop ecosystem.

Orchestration Frameworks in the Hadoop Ecosystem

There are a few workflow engines in the Hadoop ecosystem. They are tightly integrated within the Hadoop ecosystem and have built-in support for it. As a result, many organizations that need to schedule Hadoop workflows and don’t have a standard automation solution choose one of these workflow engines for workflow automation and scheduling.

A few of the more popular open source workflow engines for distributed systems include Apache Oozie, Azkaban, Luigi, and Chronos:

Oozie was developed by Yahoo! in order to support its growing Hadoop clusters and the increasing number of jobs and workflows running on those clusters.



Download



Copyright Disclaimer:
This site does not store any files on its server. We only index and link to content provided by other sites. Please contact the content providers to delete copyright contents if any and email us, we'll remove relevant links or contents immediately.